A Multilingual Phonological Resource Toolkit for Ubiquitous Speech Technology
نویسندگان
چکیده
This paper outlines the generation process of a specific computational linguistic representation termed the Multilingual Time Map, conceptually a multi-tape finite state transducer encoding linguistic data at different levels of granularity. The first component acquires phonological data from syllable labeled speech data, the second component defines feature profiles, the third component generates feature hierarchies and augments the acquired data with the defined feature profiles, and the fourth component displays the Multilingual Time
منابع مشابه
A Platform for Multilingual Research in Spoken Dialogue Systems
Multilingual speech technology research would be greatly facilitated by an integrated and comprehensive set of software tools that enable research and development of core language technologies and interactive language systems in any language. Such a multilingual platform has been one of our goals in developing the CSLU Toolkit. The Toolkit is composed of components that are essentially language...
متن کاملGeneric Techniques for Multilingual Speech Technology Applications
This paper is concerned with generic techniques for representing and evaluating phonological information in multilingual speech technology applications. A computational linguistic model of phonological interpretation is enhanced by a framework for constructing and evaluating phonotactic automata and by a generic lexicon model. The techniques make way for the extension of current speech technolo...
متن کاملMultilingual text analysis for text-to-speech synthesis
We present a model of text analysis for text-to-speech (TTS) synthesis based on (weighted) finite-state transducers, which serves as the text-analysis module of the multilingual Bell Labs TTS system. The transducers are constructed using a lexical toolkit that allows declarative descriptions of lexicons, morphological rules, numeral-expansion rules, and phonological rules, inter alia. To date, ...
متن کاملMultilingual phonological analysis and speech synthesis
We give an overview of multilingual speech synthesis using the IPOX system. The first part discusses work in progress for various languages: Tashlhit Berber, Urdu and Dutch. The second part discusses a multilingual phonological grammar, which can be adapted to a particular language by setting parameters and adding languagespecific details.
متن کاملPhonVoc: A Phonetic and Phonological Vocoding Toolkit
We present the PhonVoc toolkit, a cascaded deep neural network (DNN) composed of speech analyser and synthesizer that use a shared phonetic and/or phonological speech representation. The free toolkit is distributed as open-source software under a BSD 3-Clause License, available at https://github. com/idiap/phonvoc with the pre-trained US English analysis and synthesis DNNs, and thus it is ready...
متن کامل